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<110> NORPHARMA SPA 

<1^0> Recombinant bacterial strains for the Production of 

natural nucleosides and modified analogues thereof 

<130> 99DC26E 



<140> PCT/EP99/10416 
<141> 1999-12-23 

<150> MI98A002792 
<151> 1998-12-23 

<160> 1:- 

<1 7 0 ^ Patentln Ver. 2.1 

<-2:0> 1 
<211> 34 4 4 
<212> DIIA 

<213> Artificial Sequence 
220> 



,:: 3 > Description of Artificial Sequence: Plasmid 



22 0> 

221> qene 

222> (243) . . (1021) 

22 3> udp 



gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 60 
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 120 
,-actcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct 240 
c.gtaccatc catgtccaag tctgatgttt ttcatctcgg cctcactaaa aacgatttac 300 
aaggggctac gcttgccatc gtccctggcg acccggatcg tgtggaaaag atcgccgcgc 360 
tgatggataa gccgcttaag ctggcatctc accgcgaatt cactacctgg cgtgcagagc 420 
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tggatggtaa acctgttatc 
ctgttgaaga gctggcacag 
ctattcagcc gcatattaat 
atggcgcgag cctgcacttc 
cgactgcgct ggttgaagct 
cttcttctga taccttctac 
ttcgtcactt taaaggttct 
tggaatctgc aaccctgctg 
cgggtgttat cgttaaccgc 
ccgaaagcca tgcggtgaaa 
gtcgacctgc aggcatgcaa 
aaaccctggc gttacccaac 
taatagcgaa gaggcccgca 
atggcgcctg atgcggtatt 
gtgcactctc agtacaatct 
aacacccgct gacgcgccct 
tgtgaccgtc tccgggagct 
gagacgaaag ggcctcgtga 
ttcttagacg tcaggtggca 
tttctaaata cattcaaata 
ataatattga aaaaggaaga 
ttttgcggca ttttgccttc 
tgctgaagat cagttgggtg 
gatccttgag agttttcgcc 



Norphll . app 

gtctgctcta ccggtatcgg cggcccgtct acctctattg 480 



ctgggcattc gcaccttcct gcgtatcggt acaacgggcg 540 
gtgggtgatg tcctggttac cacggcgtct gtccgtctgg 600 
gcaccgctgg aattcccggc tgtcgctgat ttcgaatgta 660 
gcgaaatcca ttggcgcgac aactcacgtt ggcgtgacag 720 
ccaggtcagg aacgttacga tacttactct ggtcgcgtag 780 
atggaagagt ggcaggcgat gggcgtaatg aactatgaaa 840 
accatgtgtg caagtcaggg cctgcgtgcc ggtatggtag 900 
acccagcaag agatcccgaa tgctgagacg atgaaacaaa 960 
atcgtggtgg aagcggcgcg tcgtctgctg taattctctt 1020 
gcttggcact ggccgtcgtt ttacaacgtc gtgactggga 1080 
ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 1140 
ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 1200 
ttctccttac gcatctgtgc ggtatttcac accgcatatg 1260 
gctctgatgc cgcatagtta agccagcccc gacacccgcc 1320 
gacgggcttg tctgctcccg gcatccgctt acagacaagc 1380 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc 1440 
tacgcctatt tttataggtt aatgtcatga taataatggt 1500 
cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt 1560 
tgtatccgct catgagacaa taaccctgat aaatgcttca lt-J.0 
gtatgagtat tcaacatttc cgtctcgccc ttattccctt 1680 
ctgtttttgc tcacccagaa acgctggtga aagtaaaaga 1740 
-acgagtggg ttacatcgaa ctggatctca acagcggtaa 1800 
ccgaagaacg tr.ttccaatg atgagcactt ttaaagttct 1860 
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gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat 19.0 
acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga 1980 
tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc 2040 
caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat 2100 
gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa 2160 
cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac 2220 
tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa 2280 
agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc 2340 
tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc 2400 
ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag 2460 
acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta 2520 
ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa 2580 
gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc 2640 
gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat 2700 
ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga 2760 
gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt 2820 
ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata 2830 
cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac 2940 
cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg 3000 
ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg 3060 
tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag 3120 
cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct 3180 
ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc 3240 
aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt 2200 
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ttactggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg 33,0 
ta.taccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga 3420 

3444 

gt.cisgtga.gc gaggaagcgg aaga 



<213> 2 
•011> 5556 
012> DNA 

Oi3> Artificial Sequence 



Description of Artificial Sequence: Plasmid 



v::o> 

s"21> gene 

<:•::> (243) . . (io:i) 

<223> udp 
O2 0> 

--2 21^ gene 

.. 222 > (1483) . . (2333) 

-:223> tetracycline resistance 

•;4 00> 2 



:gcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 60 
coacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 120 
cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 130 
tawaccgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct 240 
cwaccatc catgtccaag tctgatgttt ttcatctcgg cctcactaaa aacgatttac 300 
aag.g.ctac gcttgccatc gtccctggcg acccggatc, tgtggaaaag atcgccgcgc 360 
; ,atagataa gccggttaag ctggcatctc accgcgaatt cactacctgg cgtgcagagc .20 
-ggatggtaa acctgttatc gtctgctcta ccggtatcg, cggcccgtct acctctatt, 4 80 
-tg,tgaaga gctggcacag ctgggcattc gcaccttcc, gcgtatcggt acaacgggc, 540 
.-.tattcagcc gcatattaat gtgggtgatg tcctggttac cacggcgtct gtccgtctgg 600 
atggcgcgag ,ctg,acttc gcaccgctgg aattcccgg, tgtcgctgat ttcgaatgta 660 
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cgactgcgct ggttgaagct gcgaaatcca ttggcgcgac aactcacgtt ggcgtgacag 720 
cttcttctg a tacct tctac ccaggtcagg aacgttacga tacttactct ggtcgcgtag 780 
ttcgtcactt taaaggttct atggaagagt ggcaggcgat gggcgtaatg aaotatgaaa 840 
tggaatctgc aaccctgctg accatgtgtg caagtcaggg cctgcgtgcc ggtatggtag 900 
cgggtgttat cgttaaccgc acccagcaag agatcccgaa tgctgagacg atgaaacaaa 960 
ccgaaagcca tgcggtgaaa atcgtggtgg aagcggcgcg tcgtctgctg taattctctt 1020 
gtcgacctgc aggcatgcaa gctttatgct tgtaaaccgt tttgtgaaaa aatttttaaa 1080 
ataaaaaagg ggacctctag ggtccccaat taattagtaa tataatctat taaaggtcat 1140 
tcaaaaggtc atccaccgga tcagcttagt aaagccctcg ctagatttta atgcggatgt 1200 
tgcgattact tcgccaacta ttgcgataac aagaaaaagc cagcctttca tgatatatct 12 60 
cccaatttgt gtagggc.ta ttatgcacgc ttaaaaataa taaaagcaga cttgacctga 1320 
tagtttggct gtgagcaatt atgtgcttag tgcatctaac gcttgagtta agccgcgccg 1330 
cgaagcggcg tcggcttgaa cgaattgtta gacattattt gccgactacc ttggtgatct 1440 
cgcctttcac gtagtggaca aattcttcca actgatctgc gcgccgagat gcgccgcgtg 1500 
cggctgctgg agatggcgga cgcgatggat atgttctgcc aagggttggt ttgcgcattc 1560 
acagttctcc gcaagaattg attggctcca attcttggag tggtgaatcc gttagcgagg 1620 
tgccgccggc ttccattcag gtcgaggtgg cccggctcca tgcaccgcga cgcaacgcgg 1680 
ggaggcagac aaggtatagg gcggcgccta caatccatgc caacccgttc catgtgctcg 1740 
ccgaggcggc ataaatcgcc gtgacgatca gcggtccagt gatcgaagtt aggotggtaa 1800 
gagccgcgag cgatccttga agctgtccct gatggtcgtc atctacctgc ctggacagca i860 
tggcctgcaa cgcgggcatc cogatgccgc cggaagcgag aagaatcata atggggaagg 1920 
ccatcca-jcc tcjcgtcgcg aacgccagca agacg-.agcc cagcgcgtcg gccgccatgc i«80 
cggcga.aat ggcctgcttc tcgccgaaac gtttggtggc gggaccagtg acgaaggctt 2040 
gagcgagggc gtgcaagatt ccgaataccg caagcgacag gccgatcatc gtcgcgctcc 2100 
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g t=ctc g cc g aaaatgaccc agagc g ct g c cg g cacct g t cct.cgagtt 2!60 
g catgataaa g aa g aca g tc ataa g t g c g g c g ac g ata g t catgccccgc g cccacc gg a 2220 
ag g a 9 ct g ac tgggttgaa, gctctcaagg gcatcggtc, acgctctccc ttat g c g act 2280 
cct g catta g gaagcagccc a g ta g ta S gt tgaggccgtt g a g ca=c g =c g cc g caa gg a ,340 
atggtgoat, caa g9 a g at g g c g cc=aaca g tcccc= gg c cac gggg cct g ccaccatac 2400 
CC ac g ccgaa acaa g c g ct= atgagcccga agtggcgagc ccgatcttcc ccatc gg t g a 2460 
t g tc gg c g at at.ggcgcca gcaaccgcac ct g t gg c g cc g gt g at g cc g g ccacgat g c 2520 
gtccggcgta gaggatccac aggacgggtg t gg tcgccat gatcgcgta, tc g atagtg g 25,0 
,- t ccaa gtag cgaagcgagc aggactgggc g g c gg ccaaa gcggtcggac agtgctccga 2640 
gaacg g g tg c gcatagaaat tgcatcaacg catatagcgc tagcagcacg =c.t.,tgac 2700 
tggcgatact gtc gg aat gg ac g atatccc g caagaggcc cggcagtacc g gcataacca 27,0 
ag.ct.tgcc tacagcatcc agggtg.cgg tgccgaggat g a= g at g agc gcattgttag 2820 
atttcataca c gg t g cctga ctg= g tta g c a.tttaactg tg.taaacta cc g cattaaa 2630 
g.-tcatgcgg atcagtga.g gtttgcaact gcgg g tcaa g g atctggatt tcgatcacgg 2,40 
cacgatcatc gtgcggg.g, g caa ggg ctc caaggatcgg gccttgatgt tacccgagag 3000 
ctt gg caccc agcctgcgcg agcaggggaa tt g a t ccggt ggat g acctt tt g aat g acc 3060 
tttaatagat tatattacta attaattggg gacccta.ag gtcccctttt ttattttaaa 3,20 
aatt-ttrca caaaacgatt tacaa g cata aagcttogca ctggccgtcg ttttaca.cg 3.80 
tcotcactgg gaaaaccctg gcgttaccca acttaatc g = cttgc.gcac atcccccttt 3240 
,gcca„ctgg cct.at.gcg aa g a g gccc g caccgatcgc ccttcccaac a g tt g c g cag 3,00 
cctga.t,gc gaatggcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc 3,,0 
acaccgcat. tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 2,20 
ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ,480 

, tlM ,. ca tctc-gigag ctgcatatgt caga 3g tttt caccgtcatc 2.540 

ttacagacaa gctgtga^cg tctc^^J * 
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accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat 3600 
gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 3660 
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 3720 
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 3780 
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 3840 
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 3900 
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 3960 
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 4020 
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 4080 
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 4140 
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 4200 
tt.gcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 4260 
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 4 320 
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 4380 
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 4440 
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 4500 
agatagtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 4560 
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 4620 
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 4680 
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 4740 
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 4800 
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 4860 
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 4920 
accaaatact gtccttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 4980 
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accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 5040 
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 5100 
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 5160 
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 5220 
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 5230 
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 5340 
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 5400 
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 5460 
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 5520 
cgagcgcagc gagtcagtga gcgaggaagc ggaaga 55d6 



<-:io> 3 

Oil> 3383 
<.212> DMA 

<. 213> Artificial Sequence 

<223- Description of Artificial Sequence: Plasrnid 



<221"> gene 

<222 - (231) . . (960) 

•;22 3> deoD 

■-4 00~> 3 



gsgcccLta cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 60 
rgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 120 

cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcttcca 240 
tggctacccc acacattaat gcagaaatgg gcgatttcgc tgacgtagtt ttgatgccag 300 
g.gacccgct gcgtgcgaag tatattgctg aaactttcct tgaagatgcc cgtgaagtga 360 
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acaacgttcg cggtatgctg ggcttcaceg gtacttacaa aggccgcaaa atttccgtaa 420 
tgggtcacgg tatgggtatc ccgtcctgct ccatctacac caaagaactg atcaccgatt 480 
tcggcgtgaa gaaaattatc cgcgtgggtt cctgtggcgc agttctgccg cacgtaaaac 540 
tgcgcgacgt cgttatcggt atgggtgcct gcaccgatto caaagttaac cgcatcogtt 600 
ttaaagacca tgactttgcc gctatcgctg acttcgacat ggtgcgtaac gcagtagatg 660 
cagctaaagc actgggtatt gatgctcgcg tgggtaacct gttctccgct gacctgttct 720 
actctccgga cggcgaaatg ttcgacgtga tggaaaaata cggoattctc ggcgtggaaa 7B0 
tggaagcggc tggtatctac ggcgtcgctg cagaatttgg cgcgaaagcc ctgaccatct 840 
gcaccgtatc tgaccacatc cgcactcacg agcagaccac tgccgctgag cgtcagacta 500 
ccttcaacga catgatcaaa atcgcaotgg aatccgttct gctgggcgat aaagagtaag 960 
tcgacctgca ggcatgcaa, cttggcactg gccgtcgttt ta.aacgtcg tgactgggaa 1020 
aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 1080 
aatagcgaag aggcccgcac cgatcgccct tcocaacagt tgcgcagcct gaatggcgaa 1140 
tggcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatatgg 1200 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagccccg acacccgcca 1260 
acacccgctg acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct 1320 
g.gaccgtct ccgggagctg catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg 1380 
agacgaaagg gcctcgtgat acgcctattt ttatagg.ta atgtcatgat aataatggtt 1440 
tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt 1500 
ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa 1560 
taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattccottt 1620 
tttgcggcat tttgccttcc tgttr.ttgct cacccagaaa cgctggtgaa agtaaaagat 1680 
gctgaaga,c a,,tgggtgc aogagtgggt tacatcgaac tgga.ctcaa cagcggtaag 1740 
atccttgaga gt.ttcgcc, cgaagaacgt tttccaatga tgagcacttt taaagttctg 1800 
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ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata 1860 
cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat 1920 
ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc 1980 
aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg 2040 
ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac 2100 
gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact 2160 
ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa 2220 
gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct 2280 
ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc 2340 
tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga 2400 
cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac 24 60 
tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag 2520 
atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg 2530 
tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc 2640 
tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag 2700 
ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc 2760 
cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac 2820 
ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc 2830 
gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt 2940 
tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt 3000 
gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc 3060 
ggcagggtcg gaacaggaga gcgcacgagg gagctrccag ggggaaacgc ctggtatctt 1-120 
tatag.cctg tcgggttt=g ccacctctga cttgagcgtc gatttttgtg atgctcgtca 2180 
ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt 3240 
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tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt 3300 
attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag 3360 

3383 

tcagtgagcg aggaagcgga aga 



<21C> 4 
<21i> 5495 
<212> DNA 

<213> Artificial Sequence 



<. _ _ 

< 22 



3> Description of Artificial Sequence: Plasmid 



<220> 

<221> gene 

<2 "(231) . . (960) 

n.22 3-* deoD 

v220"* 

v221 N gene 

v222 > (1423) . . (2822) 

-.223 N tetracycline resistance 



gcg!4Lta cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 60 
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 120 
cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcttcca 240 
tggctacccc acacattaat gcagaaatgg gcgatttcgc tgacgtagtt ttgatgccag 300 
gccacccgct gcgtgcgaag tatattgctg aaactttcct tgaagatgcc cgtgaagtga 360 
acaacgttcg cggtatgctg ggcttcaccg gtacttacaa aggccgcaaa atttccgtaa 420 
tggqtcacgg tatgggtatc ccgtcctgct ccatctacac caaagaactg atcaccgatt 480 
tcggcgtgaa gaaaattatc cgcgtgggtt cctgtggcgc agttctgccg cacgtaaaac 540 
tgcgcgacgt cgttatcggt atgggtgcct gcaccgattc caaagttaac cgcatccgtt 600 
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ttaaagacca tgactttgcc gctatcgctg acttcgacat ggtgcgtaac gcagtagatg 660 
cagctaaagc actgggtatt gatgctcgcg tgggtaacct gttctccgct gacctgttct 720 
actctccgga cggcgaaatg ttcgacgtga tggaaaaata cggcattctc ggcgtggaaa 780 
tggaagcggc tggtatctac ggcgtcgctg cagaatttgg cgcgaaagcc ctgaccatct 840 
gcaccgtatc tgaccacatc cgcactcacg agcagaccac tgccgctgag cgtcagacta 900 
ccttcaacga catgatcaaa atcgcactgg aatccgttct gctgggcgat aaagagtaag 960 
tcgacctgca ggcatgcaag ctttatgctt gtaaaccgtt ttgtgaaaaa atttttaaaa 1020 
taaaaaaggg gacctctagg gtccccaatt aattagtaat ataatctatt aaaggtcatt 1080 
caaaaggtca tccaccggat cagcttagta aagccctcgc tagattttaa tgcggatgtt 1140 
gcgattactt cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc 1200 
ccaatttgtg tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat 1260 
agtttggctg tgagcaatta tgtgcttagt gcatctaacg cttgagttaa gccgcgccgc 1320 
gaagcggcgt cggcttgaac gaattgttag acattatttg ccgactacct tggtgatctc 1380 
gcctttcacg tagtggacaa attcttccaa ctgatctgcg cgccgagatg cgccgcgtgc 1440 
ggctgctgga gatggcggac gcgatggata tgttctgcca agggttggtt tgcgcattca 1500 
cagttctccg caagaattga ttggctccaa ttcttggagt ggtgaatccg ttagcgaggt 1560 
gccgccggct tccattcagg tcgaggtggc ccggctccat gcaccgcgac gcaacgcggg 1620 
gaggcagaca aggtataggg cggcgcctac aatccatgcc aacccgttcc atgtgctcgc 1680 
cgaggcggca taaatcgccg tgacgatcag cggtccagtg atcgaagtta ggctggtaag 1740 
agccgcgagc gatccttgaa gctgtccctg a.ggtcgtca tctacctgcc tggacagcat 1800 
ggcctgcaac gcgggcatcc cgatgccgcc ggaagcgaga agaatcataa tggggaaggc 18 60 
catccagcct cgcgtcgcga acgccagcaa gacgtagccc agcgcgtcgg ccgccatgcc 1920 
ggcgataatg gcctgcttct cgccgaaacg tttggtggcg ggaccagtga cgaaggcttg 1980 
agcgagggcg tgcaagattc cgaataccgc aagcgacagg ccgatcatcg tcgcgctcca 2040 
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gcgaaagcgg tcctcgccga aaatgaccca gagcgctgcc ggcacctgtc ctacgagttg 2100 
catgataaag aagacagtca taagtgcggc gacgatagtc atgccccgcg cccaccggaa 2160 
ggagctgact gggttgaagg ctctcaaggg catcggtcga cgctctccct tatgcgactc 2220 
ctgcattagg aagcagccca gtagtaggtt gaggccgttg agcaccgccg ccgcaaggaa 2280 
tggtgcatgc aaggagatgg cgcccaacag tcccccggcc acggggcctg ccaccatacc 2340 
cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc catcggtgat 2400 
gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg ccacgatgcg 2460 
tccggcgtag aggatccaca ggacgggtgt ggtcgccatg atcgcgtagt cgatagtggc 2520 
tccaagtagc gaagcgagca ggactgggcg gcggccaaag cggtcggaca gtgctccgag 2580 
aacgggtgcg catagaaatt gcatcaacgc atatagcgct agcagcacgc catagtgact 2640 
ggcgatgctg tcggaatgga cgatatcccg caagaggccc ggcagtaccg gcataaccaa 2700 
gcctatgcct acagcatcca gggtgacggt gccgaggatg acgatgagcg cattgttaga 2760 
tttcatacac ggtgcctgac tgccttagca atttaactgt gataaactac cgcattaaag 2820 
ctcatccgga tcagtgaggg tttgcaactg cgggtcaagg atctggattt cgatcacggc 2880 
acgatcatcg tgcgggaggg caagggctcc aaggatcggg ccttgatgtt acccgagagc 2940 
ttggcaccca gcctgcgcga gcaggggaat tgatccggtg gatgaccttt tgaatgacct 3C00 
ttaatagatt atattactaa ttaattgggg accctagagg tccccttttt tattttaaaa 3060 
attttttcac aaaacggttt acaagcataa agcttggcac tggccgtcgt tttacaacgt 3120 
cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc 3180 
gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc 3240 
ctgaatggcg aatggcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca 3300 
caccgcatat cgtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc 3360 
cgacacccgc raacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct 3420 
tacagacaag ctgtgaccgt ctccggqagc tgcatgtgtc agaggttttc accgtcatca 3430 
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ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg 3540 
ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct 3600 
atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga 3660 
taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc 3720 
cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg 3780 
aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc 3840 
aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact 3900 
tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc 3960 
ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag 4020 
catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat 4080 
aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt 4140 
ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa 4200 
gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc 4260 
aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg 4 320 
gacgcggata aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt 4380 
gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc actggggcca 4440 
gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat 4500 
gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca 4560 
gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg 4 620 
atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg 4 680 
ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt 4740 
ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg 4800 
ccggatcaag agctaccaac r=tttttccg aaggtaactg gcttcagcag agcgcagata 4860 
ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca 4920 
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cc(1 ,,t.c.t .cctcctct gctaatcct, tta=ca g t gg ct g ct g cca g t gg , g ataa g 

t „ £ ,t=tta cc gggttgg a c t caa,ac ? a ta g ttacc gg ataa gg c g ca g c gg tc ggg c ,0,0 
M aa, ggggg gt tc gtg cac aca g ccca g c tt gg a g c 9 aa c g accta=ac c g aact g a g a 5100 
ta „ t ac„= 9 t 9 ac,ctat 9 a g aaa gCg c= ac g c t tccc g aa ggg a g aaa gg c gg aca gg 5160 
tat,c M taa „c gg ca ggg t c M aaca g9 . «c,cac,a ggg a g cttcc a ggggg aaac 5220 
g ,, tggt atc tttatagtcc t g tc g g,ttt c 9 c=.=ct=t g ac t t g a g c g tc g attttt g 5280 
tg a Cg ctc g t ca ggggggCg g a g cc t a tgg aaaaac g cca g caac g c gg c ctttttac,, 53,0 
tr ,,- tgg ==t « t ,ct g ,cc tttt g =teac at g «=tttc ct g <= g tt.tc ==ct g attct 5,00 
grM a t ,.=c 9 tattacc g c cttt g a g t g a g =t,atacc g c t c g cc 9 ca g cc g aac g acc 5,60 

5495 

gagcgcagcg agtcagtgag cgaggaagcg gaaga 



^211- 4189 
-.21 2-- DHA 

<-].?/.• Artificial Sequenc 



V>:3: Description of Artificial Sequence: Plasmxd 
<220> 

<221> gene 

<222> (243) . . (1021) 

<.2 2 3> udp 

< 2 2 0 > 

*.""!> qene 

<2:.2> '(1037) . . (1766) 
v.22 3> deoD 

r-° 3 i: :;-caa t a c g caaaoc g c ctctccccgc g c g tt gg cc g attc.tt.at g c.,ct„ca CO 

-_ aggtf ,- ccc g act gg a aa g c ggg ca g t g a g = g =aac g caattaat g t g a g tta g ct 120 
-',-~at,a g 3 cacccca gg =tttacac« tat g cttc = g g ctc g ta tg t t gtg t gg aat 180 
, 3tgag c, g a taacaatttc acaca gg aaa c. g =ta t3 ,= ca tg attac g aattc.a.ct 2,0 
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cggtacoatc catgtccaag tctgatgttt ttcatctcgg cctoactaaa aacgatttac 300 
aaggggctac gcttgccatc gtccctggcg acccggatcg tgtggaaaag atcgccgcgc 360 
tgatggataa gccggttaag ctggcatctc accgcgaatt cactacctgg cgtgcagagc 420 
tggatggtaa acctgttatc gtctgctcta ccggtatcg, oggcccgtct acctctattg 480 
ctgttgaaga gctggcacag ctgggcattc gcaccttcct gcgtatcggt acaacgggcg 540 
ctattcagcc gcatattaat gtgggtgatg tcctggttac cacggcgtct gtccgtctgg 600 
atogcgcgag cctgcacttc gcaccgctgg aattcccggc tgtcgctgat ttcgaatgta 660 
cgactgcgct ggttgaagct gcgaaatcca ttggcgcgac aactcacgtt ggcgtgacag 720 
cttcttctga taccttctac ccaggtcagg aacgttacga taottactct ggtcgcgtag 780 
trcgtcactt taaaggttct atggaagagt ggcaggcgat gggcgtaatg aaotatgaaa 840 
tggaatctgc aaccotgc.g accatgtgtg caagtcaggg cctgcgtgcc ggtatggtag 900 
cgggtgttat cgttaaccgc acccagcaag agatcccgaa tgctgagacg atgaaacaaa 960 
ccgaaagcca tgcggtgaaa atcgtggtgg aagcggcgcg tcgtctg.tg taattctctt 1020 
gtcgactagc aggaggaatt c«ccatggc taccccacac attaatgcag aaatgggcga 1080 
«t=,ct,ac gtagttttga tgccaggcga cccgctgcgt gcgaagtata ttgcigaaac 1140 
tttccttgaa gatgcccgtg aagtgaacaa cgttcgcggt atgctgggct tcaccggtac 1200 
ttacaaaggc cgcaaaattt ccgtaatggg tcacggtatg ggtatcccgt cctgctccat 1260 
ctacaccaaa gaactgatca ccgatttcgg cgtgaagaaa attatccgcg tgggttcctg 1320 
tggcgcagtt ctgccgcacg taaaactgcg cgacgtcgtt atcggtatgg gtgcctgcac 1330 
cjattcca.a gttaaccgca tccgttttaa agaccatgac tttgccgcta tcgctgactt 1440 
cgacatggtg cgtaacgcag tagatgcagc taaagcactg ggtattgatg ctcgcgtggg 1500 
taacctgttc tccgctgaco tgttctactc tccggacggc gaaatgt.cg acgtgatgga 1560 
aaaatacggc att.tcggcg tggaaatgga agcggctggt at=tac, 3 cg tcgctgcaga 1620 
atttggcgcg aaagccctga ccatctgcac cgtatctgac cacatccgca ctcacgagca 1680 
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gaccactgcc gctgagcgtc agactacctt caacgacatg atcaaaatcg cactggaatc 1740 
cgttctgctg ggcgataaag agtaagtcga cctgcaggca tgcaagcttg gcactggccg 1800 
tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag I860 
cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc 1920 
aacagttgcg cagcctgaat ggcgaatggc gcctgatgcg gtattttctc cttacgcatc 1930 
tgtgcggtat ttcacaccgc atatggtgca ctctcagtac aatctgctct gatgccgcat 2040 
agttaagcca gccccgacac ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc 2100 
tcccggcatc cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt 2160 
tttcaccgtc atcaccgaaa cgcgcgagac gaaagggcct cgtgatacgc ctatttttat 2220 
aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt cggggaaatg 2280 
tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 2340 
gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 2400 
atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 2460 
cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 2520 
tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 2580 
caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 2640 
ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 2700 
cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 2760 
taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 2820 
agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 2880 
cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 2940 
caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 3000 
taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 3060 
ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 2120 
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cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 3180 
aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 3240 
attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 3300 
tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 3360 
aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 3420 
gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 3430 
cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 3540 
gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca 3600 
agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 3660 
ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 3720 
cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 3780 
acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 3840 
gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 3900 
ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 3960 
agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 4020 
cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 4080 
tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 4140 
gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaaga 4189 



<- 2 1 ("i :•• 6 
<-.211> 6301 
*. 212"- DNA 

< -i 3-. Artificial Sequence 

?:5:3'- Description of Artificial Sequence: Plasmid 
■::2J0 • 
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<221> gene 
<.222> (243) . . (1021) 
^ ■ 



<^^^:>^ udp 



<2 21> gene 

<222> (1037) . . (1766) 
<223> deoD 

<220> 

<2"*1> gene 

<222> (2229) . . (3628) 

<223> tetracycline resistance 

a'gcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 60 
cgacaggttt cccgactgga aagcgggcag tgagcgcaac goaattaatg tgagttagct 120 
cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct 240 
cg.raccatc catg.ccaa, tctgatgttt ttcatctcgg cctcactaaa aacgatttac 300 
aaggggctac gcttgccatc gtccctggcg acccggatcg tgtggaaaag atcgccgcgc 360 
cgatggataa gccggttaag ctggcatctc accgcgaatt cactacctgg cgtgcagagc 420 
tggatggtaa acctgttatc gtctgctcta ccggtatcgg cggcccgtct acctctattg 430 
ccgttgaaga gctggcacag ctgggcattc gcaccttcct gcgtatcggt acaacgggog 540 
etattcagcc gcatattaat gtgggtgatg tcctggttac cacggcgtct gtccgtctgg 600 
atggcgcgag cctgcacttc gcaccgctgg aattcccggc tgtcgctgat ttcgaatgta 660 
cgactgcgct ggttgaagct gcgaaatcca ttggcgcgac aactcacgtt ggcgtgacag 720 
^tcttctga taccttctac ccaggtcagg aacgttacga tacttactct ggtcgcgtag 760 
t,cgtcactt taaaggttct atggaagagt ggcaggcgat gggcgtaatg aactatgaaa 640 
rggaatctgc aaccctgctg accatgtgtg caagtcaggg cctgcgtgcc ggtatggtag 900 
,gggtgttat cgttaaccgc acccagcaag agatcccgaa tgctgagacg atgaaacaaa 960 
,cgaaagcca tgcggtgaaa atcgtggtgg aagcgg=gcg tcgtctgctg taattctctt 1020 
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gtcgactagc a,ga gg aatt cttccatggc taceccacac attaatgcag aaatgggcga 1080 
tttcgctgac gtagttttga tgccaggcga cccgctgcgt gcgaagtata ttgctgaaac 1140 
tttccttgaa gatgcccgtg aagtgaacaa cgttcgcggt atgctgggct tcaccggtac 1200 
ttacaaaggc cgcaaaattt ccgtaatggg tcacggtatg ggtatcccgt cctgctccat 1260 
ctacaccaaa gaactgatca ccgatttcgg cgtgaagaaa attatccgcg tgggttcctg 1320 
tgccgcagtt ctgccgcacg taaaactgcg cgacgtcgtt atcggtatgg gtgcctgcac 1360 
cgattccaaa gttaaccgca tccgttttaa agaccatgac tttgccgcta tcgctgactt 1440 
cgacatggtg cgtaacgcag tagatgcagc taaagcactg ggtattgatg ctcgcgtggg 1.00 
taacctgttc tccgctgacc tgttctactc tccggacggc gaaatgttcg acgtgatgga 1560 
aaaatacggc attctcggcg tggaaatgga agcgg,tggt atctacggcg tcgctgcaga 1620 
atttg , cgcg aaagccctga ccatctgcac cgtatctgac cacatccgca ctcacgagca 1630 
gaccac-.gcc gctgagcgtc agactacctt caaggacatg atcaaaatcg cactggaatc 1740 
cgttctgctg ggcgataaag agtaagtcga cctgcaggca tgcaagcttt atgcttgtaa 1800 
accgttttgt gaaaaaattt ttaaaataaa aaaggggacc tctagggtcc ccaattaatt 1860 
agtaatataa tctattaaag gtcattoaaa aggtcatcca ccggatcagc ttagtaaagc 1920 
cctcgctaga ttttaatgcg gatgttgcga ttacttcgcc aactattgcg ataacaagaa 1930 
aaagccagcc tttcatgata tatctcccaa tttgtgtagg gcttattatg cacgcttaaa 2040 
aataataaaa gcagacttga cctgatagtt tggctgtgag caattatgtg cttagtgcat 2100 
ctaacgcttg agttaagccg cgccgcgaag cggcgtcggc ttgaacgaat tgttagacat 2160 
tatttgccga ctaccttggt gatctcgoct ttcacgtagt ggacaaattc ttccaactga 2220 
tctgcgcgcc gagatgcgcc gcgtgcggct gctggagatg gcggacgcga tggatatgtt 2280 
ctgccaaggg ttggtttgcg cattcaoagt tctccgcaag aattgattgg ctccaattct 2 340 
tggagtggtg aatccgttag cgaggtgccg ccggc.tcca ttcaggtcga gg.ggcccgg ,400 

rH,i?-aaaqt ataaggcqgc gcctacaatc 2460 
rtccatgcac cgcgacgcaa cgcgg^gag^ c d ga _aa^ j ^ ^ 
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catgccaacc cgttccatgt gctcgccgag gcggcataaa tcgccgtgac gatcagcggt 2520 
ccagtgatcg aagttaggct ggtaagagcc gcgagcgatc cttgaagctg tccctgatgg 2530 
tcgtcatcta cctgcctgga cagcatggcc tgcaacgcgg gcatcccgat gccgccggaa 2640 
gcgagaagaa tcataatggg gaaggccatc cagcctcgcg tcgcgaacgc cagcaagacg 2700 
tagcccagcg cgtcggccgc catgccggcg ataatggcct gcttctcgcc gaaacgtttg 2760 
gtggcgggac cagtgacgaa ggcttgagcg agggcgtgca agattccgaa taccgcaagc 2820 
gacaggccga tcatcgtcgc gctccagcga aagcggtcct cgccgaaaat gacccagagc 2880 
gctgccggca cctgtcctac gagttgcatg ataaagaaga cagtcataag tgcggcgacg 2940 
atagtcatgc cccgcgccca ccggaaggag ctgactgggt tgaaggctct caagggcatc 3000 
ggtcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 3060 
ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 3120 
ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 3180 
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 3240 
gcgccggtga tgccggccac gatgcgtccg gcgtagagga tccacaggac gggtgtggtc 3300 
gccatgatcg cgtagtcgat agtggctcca agtagcgaag cgagcaggac tgggcggcgg 3360 
ccaaagcggt cggacagtgc tccgagaacg ggtgcgcata gaaattgcat caacgcatat 3420 
agcgctagca gcacgccata gtgactggcg atgctgtcgg aatggacgat atcccgcaag 3480 
aggcccggca gtaccggcat aaccaagcct atgcctacag catccagggt gacggtgccg 3540 
agaatgacga tgagcgcatt gttagatttc atacacggtg cctgactgcg ttagcaattt 3600 
aactgtgata aactaccgca ttaaagctca tgcggatcag tgagggtttg caactgcggg 3660 
tcaaggatct ggatttcgat cacggcacga tcatcgtgcg ggagggcaag ggctccaagg 3720 
atcgggcctt gatgttaccc gagagcttgg cacccagcct gcgcgagcag gggaattgat 3730 
ccggtggatg accttttgaa tgacctttaa tagattatat tactaattaa ttggggaccc 3840 
tagaggtccc cttttttatt ttaaaaattt tttcacaaaa cggtttacaa gcataaagct 3900 
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tggcactggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 3960 
atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 4020 
atcgcccttc ccaaoagttg cgcagcctga atggcgaatg gcgcctgatg cggtattttc 4030 
tccttacgca tctgtgcggt atttcacacc gcatatggtg cactctcagt acaatctgct 4140 
ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac 4200 
gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca 4260 
tgtgtcagag gttttcaccg tcatcaocga aacgcgcgag acgaaagggc ctcgtgatac 4320 
gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt 4380 
ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 4440 
atccgctoat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 4500 
tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 4560 
tttttgctca cccagaaacg otggtgaaag taaaagatgo tgaagatcag ttgggtgcac 4620 
gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 4630 
aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatcco 4740 
gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 4300 
ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 4860 
acagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 4520 
gaggaccgaa ggagctaacc gottttttgc acaacatgg, ggatcatgta actcgccttg 4980 
atcgttggga aocggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 5040 
ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 5100 
ccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 5160 
cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 5220 
gcgqtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 5280 
cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 5340 
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cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 5400 
taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 5460 
ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 5520 
aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 5580 
caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 5640 
taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 5700 
gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 5760 
cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 5820 
taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 5830 
agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 5940 
ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 6000 
gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 6060 
acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 6120 
acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 6180 
tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 6240 
ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 6300 



6301 



•:210> 7 
•:211> 5241 

dua 

<213> Artificial Sequence 



■-:'::3^ Description of Artificial Sequence: Plasmid 

<i.20 

•-:221~- gene 

<222 ■ (1312) . . (2042) 
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<223> deoD 



aJcgatgcat aatgtgcctg tcaaatggac gaagcaggga ttctgcaaac cctatgctac 60 
tccgtcaagc cgtcaattgt ctgattcgtt accaattatg acaacttgac ggctacatca 120 
ttcacttttt cttcacaacc ggcacggaac tcgctcgggc tggccccggt gcatttttta 180 
aatacccgcg agaaatagag ttgatcgtca aaaccaacat tgcgaccgac ggtggcgata 240 
ggcatccggg tggtgctcaa aagcagcttc gcctggctga tacgttggtc ctcgcgccag 300 
cttaagacgc taatccctaa ctgctggcgg aaaagatgtg acagacgcga cggcgacaag 360 
caaacatgct gtgcgacgct ggcgatatca aaattgctgt ctgccaggtg atcgctgatg 420 
tactgacaag cctcgcgtac ccgattatcc atcggtggat ggagcgactc gttaatcgct 430 
tccatgcgcc gcagtaacaa ttgctcaagc agatttatcg ccagcagctc cgaatagcgc 540 
ccttcccctt gcccggcgtt aatgatttgc ccaaacaggt cgctgaaatg cggctggtgc 600 
gcttcatccg ggcgaaagaa ccccgtattg gcaaatattg acggccagtt aagccattca 660 
tgccagtagg cgcgcggacg aaagtaaacc cactggtgat accattcgcg agcctccgga 720 
tgacgaccgt agtgatgaat ctctcctggc gggaacagca aaatatcacc cggtcggcaa 780 
acaaattctc gtccctgatt tttcaccacc ccctgaccgc gaatggtgag attgagaata 840 
taacctttca ttcccagcgg tcggtcgata aaaaaatcga gataaccgtt ggcctcaatc 900 
ggcgttaaac ccgccaccag atgggcatta aacgagtatc ccggcagcag gggatcattt 960 
tgcgcttcag ccatactttt catactcccg ccattcagag aagaaaccaa ttgtccatat 1020 
tgcatcagac attgccgtca ctgcgtcttt tactggctct tctcgctaac caaaccggta 1080 
accccgcrta ttaaaagcat tctgtaacaa agcgggacca aagccatgac aaaaacgcgt 1140 
aacaaaagtg tctataatca cggcagaaaa gtccacattg attatttgca cggcgtcaca 1200 
ctttgctatg ccatagcatt tttatccata agattagcgg atcctacctg acgcttttta 1260 
tcgcaactct ctactgtttc tccatacccg tttttttggg ctagcaggag ggaattcttc 1320 
catggctacc xacacatta atgcagaaat gggcgatttc gctgacgtag ttttgatgcc 1330 
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aggcgacccg 
gaacaacgtt 
aatgggtcac 
tttcggcgtg 
actgcgcgac 
ttttaaagac 
tgcagctaaa 
ctactctccg 
aatggaagcg 
ctgcaccgta 
taccttcaac 
agtcgacctg 
gatacagatt 
tagcgcggtg 
tggtagtgtg 
aggctcagtc 
tgagtaggac 
ggcgggcagg 
cggatggcct 
atgtatccgc 
agtatgagta 
cctgtttttg 
gcacgagtgg 
cccgaagaac 



ctgcgtgcga 
cgcggtatgc 
ggtatgggta 
aagaaaatta 
gtcgttatcg 
catgactttg 
gcactgggta 
gacggcgaaa 
gctggtatct 
tctgaccaca 
gacatgatca 
caggcatgca 
aaatcagaac 
gtcccacctg 
gggtctcccc 
gaaagactgg 
aaatccgccg 
acgcccgcca 
ttttgcgttt 
tcatgagaca 
ttcaacattt 
ctcacccaga 
gttacatcga 
gttttccaat 



Norphll . app 
agtatattgc tgaaactttc 
tgggcttcac cggtacttac 
tcccgtcctg ctccatctac 
tccgcgtggg ttcctgtggc 
gtatgggtgc ctgcaccgat 
ccgctatcgc tgacttcgac 
ttgatgctcg cgtgggtaac 
tgttcgacgt gatggaaaaa 
acggcgtcgc tgcagaattt 
tccgcactca cgagcagacc 
aaatcgcact ggaatccgtt 
agcttggctg ttttggcgga 
gcagaagcgg tctgataaaa 
accccatgcc gaactcagaa 
atgcgagagt agggaactgc 
gcctttcgtt ttatctgttg 
ggagcggatt tgaacgttgc 
taaactgcca ggcatcaaat 
ctacaaactc ttttgtttat 
ataaccctga taaatgcttc 
ccgtgtcgcc cttattccct 
aacgctggtg aaagtaaaag 
actggatc^c aacagcggta 



gatgagcact tttaaagttc 
Page 2 5 



cttgaagatg 
aaaggccgca 
accaaagaac 
gcagttctgc 
tccaaagtta 
atggtgcgta 
ctgttctccg 
tacggcattc 
ggcgcgaaag 
actgccgctg 
ctgctgggcg 
tgagagaaga 
cagaatttgc 
gtgaaacgcc 
caggcatcaa 
tttgtcggtg 
gaagcaacgg 
taagcagaag 
ttttctaaat 
aataatattg 
tttttgcggc 
atgctgaaga 
agatccttga 
tgctatgtgg 



cccgtgaagt 
aaatttccgt 
tgatcaccga 
cgcacgtaaa 
accgcatccg 
acgcagtaga 
ctgacctgtt 
tcggcgtgga 
ccctgaccat 
agcgtcagac 
ataaagagta 
ttttcagcct 
ctggcggcag 
gtagcgccga 
ataaaacgaa 
aacgctctcc 
cccggagggt 
gccatcctga 
acattcaaat 
aaaaaggaag 
attttgcctt 
tcagttgggt 
gagttttcgc 

CQC 

ggtatta 



1440 
1500 
15 60 
162 0 
1630 
1740 
1800 
IS 60 
1920 
198 0 
2 04 0 
2100 
2160 
2220 
228 0 



2340 
2400 
24 60 
2520 
2580 
2 64 0 
2700 
27 60 
2820 



Norphll . app 

tccgtgttg acgccgggca agagcaactc ggtcgccgc. tacactattc tcagaatgac 2880 
ttggttgagt aotoaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa 2940 
ttatgcagt, ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg 3000 
atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc 3060 
cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg 3120 
atacctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta 3180 
gcttcccggo aacaattaat agactggatg gaggcggata aagttgoagg accacttctg 3240 
cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg 3300 
tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc 3360 
tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt 3420 
gcotcactga ttaagcattg gtaactgtca gaccaagtr.t actcatatat actttagatt 3480 
gatttacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 3540 
gaccgctaoa cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 3600 
cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 3660 
atttagtgct ttacggcacc tcgaccccaa aaaacttgat ttgggtgatg gttcacgtag 3720 
taogccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 3780 
tagtggactc ttgttccaaa cttgaacaac actcaaccct atctcgggct attcttttga 3840 
tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 3900 
atttaacgcg aattttaaca aaatattaac gtttacaatt taaaaggatc taggtgaaga 3960 
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 4020 
,a,acc=c,t agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatc, 4080 
gctgcttgca aacaaaaaaa ccaccgctac =,gc,gt,gt ttgtttgccg gatcaagagc 4140 
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtc, 4200 
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 4260 
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tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 4320 
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 4380 
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 4440 
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 4500 
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 4560 
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 4620 
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 4630 
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 4740 
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 4800 
cagtgagcga ggaagcggaa gagcgcctga tgcggtattt tctccttacg catctgtgcg 4860 
gtatttcaca ccgcataggg tcatggctgc gccccgacac ccgccaacac ccgctgacgc 4920 
gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 4980 
gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc agcaaggaga 5040 
tggcgcccaa cagtcccccg gccacggggc ctgccaccat acccacgccg aaacaagcgc 5100 
tcatgagccc gaagtggcga gcccgatctt ccccatcggt gatgtcggcg atataggcgc 5160 
cagcaaccgc acctgtggcg ccggtgatgc cggccacgat gcgtccggcg tagaggatct 5220 

5241 

qctcatgttt gacagcttat c 



-210:- 8 
-.21 i> 5822 
-212> DHA 

-213 • Artificial Sequence 

'223-- Description of Artificial Sequence: pGM716 with 
deletion of Hpal fragment 

" :400 > 8 nt ^^ r ^^ a-attqqccg attcattaat gcagctggca 60 

gcgcccaata cgcaaaccgc Ui-l'^xv^ .^-jLLyj^y 
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cgacaggttt gccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 120 
cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct 240 
cggtaccatc catgtccaag tctgatgttt ttcatctcgg cctcactaaa aacgatttac 300 
aaggggctac gcttgccatc gtccctggcg acccggatcg tgtggaaaag atcgccgcgc 360 
tgatggataa gccggttaag ctggcatctc accgcgaatt cactacctgg cgtgcagago 420 
tggatggtaa acctgttatc gtctgctcta ccggtatcgg cggcccgtct acctctattg 430 
ctgttgaaga gctggcacag ctgggcattc gcaccttcct gcgtatcggt acaacgggcg 540 
ctattcagcc gcatattaat gtgggtgatg tcctggttac cacggcgtct gtccgtctgg 600 
atggcgcgag cctgcacttc gcacogctgg aattcccggc tgtcgctgat ttcgaatgta 660 
cgactgcgct ggttgaagct gcgaaatcga ttggcgcgac aactcacgtt ggcgtgacag 720 
cttcttctga taccttctac ccaggtcagg aacgttacga tacttactct ggtcgcgtag 780 
ttcgtcactt taaaggttct atggaagagt ggcaggcgat gggcgtaatg aactatgaaa 840 
tggaatctgc aaccctgctg accatgtgtg caagtcaggg cctgcgtgcc ggtatggtag 900 
cgggtgttat cgttaaccgc atccgtttta aagaocatga ctttgccgct atcgctgact 960 
tcgacatggt gcgtaacgca gtagatgcag ctaaagcact gggtattgat gctogcgtgg 1020 
gtaacctgtt ctccgctgac ctgttctact ctccggacgg cgaaatgttc gacgtgatgg 1080 
aaaaatacgg cattctcggc gtggaaatgg aagcggctgg tatctacggc gtcgctgcag 1140 
aatttggcgc gaaagccctg accatctgca ccgtatctga ccacatccgc actcacgagc 1200 
agaccactgc egctgagcgt cagactacct tcaacgacat gatcaaaatc gcactggaat 1260 
ccgttctgct gggcgataaa gagtaagtcg acctgcaggc atgcaagctt tatgcttgta 1320 
aaccgttttg tgaaaaaatt tttaaaataa aaaaggggac ctctagggtc cccaattaat 1380 
tagtaatata atctattaaa ggtcattcaa aaggtcatcc accggatcag ottagtaaag 1440 
ccctcgctag attttaatgc ggatgttgcg attacttcgc caactattgc gataacaaga 1500 
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aaaagccagc ctttcatgat atatctccca atttgtgtag ggcttattat gcacgcttaa 1560 
aaataataaa agcagacttg acctgatagt ttggctgtga gcaattatgt gcttagtgca 1620 
tctaacgctt gagttaagcc gcgccgcgaa gcggcgtcgg cttgaacgaa ttgttagaca 1G30 
ttatttgccg actaccttgg tgatctcgcc tttcacgtag tggacaaatt cttccaactg 1740 
atctgcgcgc cgagatgcgc cgcgtgcggc tgctggagat ggcggacgcg atggatatgt 1800 
tctgccaagg gttggtttgc gcattcacag ttctccgcaa gaattgattg gctccaattc 1860 
ttggagtggt gaatccgtta gcgaggtgcc gccggcttcc attcaggtcg aggtggcccg 1920 
gctccatgca ccgcgacgca acgcggggag gcagacaagg tatagggcgg cgcctacaat 1980 
ccatgccaac ccgttccatg tgctcgccga ggcggcataa atcgccgtga cgatcagcgg 2040 
tccagtgatc gaagttaggc tggtaagagc cgcgagcgat ccttgaagct gtccctgatg 2100 
gtcgtcatct acctgcctgg acagcatggc ctgcaacgcg ggcatcccga tgccgccgga 2160 
agcgagaaga atcataatgg ggaaggccat ccagcctcgc gtcgcgaacg ccagcaagac 2220 
gtagcccagc gcgtcggccg ccatgccggc gataatggcc tgcttctcgc cgaaacgttt 2280 
ggtggcggga ccagtgacga aggcttgagc gagggcgtgc aagattccga ataccgcaag 2240 
cgacaggccg atcatcgtcg cgctccagcg aaagcggtcc tcgccgaaaa tgacccagag 2400 
cgctgccggc acctgtccta cgagttgcat gataaagaag acagtcataa gtgcggcgac 2460 
gatagtcatg ccccgcgccc accggaagga gctgactggg ttgaaggctc tcaagggcat 2520 
cggtcgacgc tctcccttat gcgactcctg cattaggaag cagcccagta gtaggttgag 2580 
gccgttgagc accgccgccg caaggaatgg tgcatgcaag gagatggcgc ccaacagtcc 2 64 0 
cccggccacg gggcctgcca ccatacccac gccgaaacaa gcgctcatga gcccgaagtg 2700 
gcgagcccga tcttccccat cggtgatgtc ggcgatatag gcgccagcaa ccgcacctgt 27 60 
ggcgccggtg atgccggcca cgatgcgtcc ggcgtagagg atccacagga cgggtgtggt 2820 
cgccatgatc gcgtagtcga tagtggctcc aagtagcgaa gcgagcagga ctgggcggcg 2880 
gccaaagcg, tcggacagtg ctccgagaac gggtg=gcat agaaattgca tcaacgcata 2940 
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tagcgctagc agcacgccat agtgactggc gatgctgtcg gaatggacga tatcccgcaa 3000 
gaggcccggc agtaccggca taaccaagcc tatgcctaca gcatccaggg tgacggtgcc 3060 
gaggatgacg atgagcgcat tgttagattt catacacggt gcctgactgc gttagcaatt 3120 
taactgtgat aaactaccgc attaaagctc atgcggatca gtgagggttt gcaactgcgg 3180 
gtcaaggatc tggatttcga tcacggcacg atcatcgtgc gggagggcaa gggctccaag 3240 
gatcgggcct tgatgttacc cgagagcttg gcacccagcc tgcgcgagca ggggaattga 3300 
tccggtggat gaccttttga atgaccttta atagattata ttactaatta attggggacc 3360 
ctagaggtcc ccttttttat tttaaaaatt ttttcacaaa acggtttaca agcataaagc 3420 
ttggcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 3480 
aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 3540 
gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat gcggtatttt 3600 
ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag tacaatctgc 3660 
tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga 3720 
cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc 3730 
atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata 3840 
cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact 3900 
tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg 3960 
tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt 4020 
atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct 4080 
gtttttgct c acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca 4140 
cgagtgggtt aca.cgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc 4200 
gaagaacgtt ttccaatgat gagcactttt aaag.tctgc tatgtggcgc ggtattatcc 4260 
cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg 4320 
gttgagtact caccagtcac agaaaagcat ctta.ggatg gcatgacagt aagagaatta 4380 
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tgcagtg.tg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc 4440 
ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt 4500 
gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg 4560 
cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct 46=0 
tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc 4630 
tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct 4740 
cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac 4800 
acgaccgoga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc 4860 
tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat 4920 
ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg 4 980 
accaaaatcc cttaaogtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc 5040 
aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa 5100 
ccacc 3 =«c cagcggtggt ttgcttgccg gatcaagago taccaactct ttctccgaag 5160 
ataactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta 5220 
ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta 5230 
ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagaogatag 5340 
ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg 5400 
gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg 5460 
cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag 5520 
cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc 5580 
caactctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa 5640 
aacaccagca acgcggcctt tttacggttc otggcctttt gctggccttt tgctcacatg 5100 
ttctttccts c ,ttat«c= tgattctgtg 3 ataac=g,a ttaccgcctt t 3 agtgagct 5760 
gata:=gctc gccgcagccg aacgaccgag cocagcgagt cagtgagcga g 3 aagcggaa 5620 
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<210> 9 
<211> 6269 
012> DUA 

<213> Artificial Sequence 



*:-°> Description of Artificial Sequence: udp and deoD 
cloned in pUC18 so to create a fusion between the 
two proteins 

gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 60 
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 120 
cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct 240 
cggtaccatc catgtccaag tctgatgttt ttcatctcgg cctcactaaa aacgatttac 300 
aaggggctac gcttgccatc gtccctggcg acccggatcg tgtggaaaag atcgccgcgc 360 
tgatggataa gccggttaag ctggcatctc accgcgaatt cactacctgg cgtgcagagc 420 
tggatggtaa acctgttatc gtctgctcta ccggtatcgg cggcccgtct acctctattg 430 
ctgttgaaga gctggcacag ctgggcattc gcaccttcct gcgtatcggt acaacgggcg 540 
ctattcagcc gcatattaat gtgggtgatg tcctggttac cacggcgtct gtccgtctgg 600 
atggcgcgag cctgcacttc gcaccgctgg aattcccggc tgtcgctgat ttcgaatgta 660 
cgactgcgct ggttgaagct gcgaaatcca ttggcgcgac aactcacgtt ggcgtgacag 720 
cttctrctga taccttctac ccaggtcagg aacgttacga tacttactct ggtcgcgtag 780 
c:;cgtcactt taaaggttct atggaagagt ggcaggcgat gggcgtaatg aactatgaaa 840 
tagaatctgc aaccctgctg accatgtgtg caagtcaggg cctgcgtgcc ggtatggtag 900 
cgggtgttat cgttaaccgc acccagcaag agatcccgaa tgctgagacg atgaaacaaa 960 
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ccgaaagcca tgcggtgaaa atcgtggtgg aagcggcgcg tcgtctgctg tccatggcta 10,0 
ccccacacat taatgcagaa atgggcgatt tcgctgacgt agttttgatg ccaggcgacc 1080 
cgctgcgtgc gaagtatatt gctgaaactt tccttgaaga tgcccgtgaa gtgaacaacg 1140 
ttcgcggtat gctgggcttc accggtactt acaaaggccg caaaatttcc gtaatgggtc 1200 
acggtatggg tatcccgtcc tgctccatct acaccaaaga actgatcacc gatttcggcg 1260 
tgaagaaaat tatccgcgtg ggttcctgtg gcgcagttct gccgcacgta aaactgcgcg 1320 
acgtcgttat cggtatgggt gcctgcaccg attccaaagt taaccgcatc cgttttaaag 1380 
accatgactt tgccgctatc gctgacttcg acatggtgcg taacgcagta gatgcagcta 1440 
aagcactggg tattgatgct cgcgtgggta acctgttctc cgctgacctg ttctactctc 1500 
cggacggcga aatgttcgac gtgatggaaa aatacggcat tctcggcgtg gaaatggaag 1560 
cggctggtat ctacggcgtc gctgcagaat ttggcgcgaa agccctgacc atctgcaccg 1620 
tatctgacca catccgcact cacgagcaga ccactgccgc tgagcgtcag actaccttca 1630 
acgacatgat caaaatcgca ctggaatccg ttctgctggg cgataaagag taagtcgacc 1740 
tgcaggcatg caagctttat gcttgtaaac cgttttgtga aaaaattttt aaaataaaaa 1300 
aggggacctc tagggtcccc aattaattag taatataatc tattaaaggt cattcaaaag I860 
gtcatccacc ggatcagctt agtaaagccc tcgctagatt ttaatgcgga tgttgcgatt 1920 
acttcgccaa ctattgcgat aacaagaaaa agccagcctt tcatgatata tctcccaatt 1980 
tgtgtagggc ttattatgca cgcttaaaaa taataaaagc agacttgacc tgatagtttg 2040 
gctgtgagca attatgtgct tagtgcatct aacgcttgag ttaagccgcg ccgcgaagcg 2100 
gcgtcggctt gaacgaattg ttagacatta tttgccgact accttggtga tctcgccttt 2160 
cacgtagtgg acaaattctt ccaactgatc tgcgcgccga gatgcgccgc gtgcggctgc 2220 
tggagatggc ggacgcgatg gatatgttct gccaagggtt ggtttgcgca ttcacagttc 2280 
tccgcaagaa ttgattggct ccaattcttg gagtggtgaa tccg.tagcg aggtgccgcc 2340 
ggcttccatt caggtcgagg tggcccggct ccatgcaccg cgacgcaacg cggggaggca 2400 
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gacaaggtat agggcggcgc ctacaatcca tgccaacccg ttccatgtgc tcgccgaggc .460 

ggcataaatc gccgtgacga tcagcggtcc agtgatcgaa gttaggctgg taagagccgc 2520 
gagcgatcct tgaagctgtc cctgatggtc gtcatctacc tgcctggaca gcatggcctg 2580 
caacgcgggc atcccgatgc cgccggaagc gagaagaatc ataatgggga aggccatcca 2640 
gcctcgcgtc gcgaacgcca gcaagacgta gcccagcgcg tcggccgcca tgccggcgat 2700 
aatggcctgc ttctcgccga aacgtttggt ggcgggacca gtgacgaagg cttgagcgag 27 60 
ggcgtgcaag attccgaata ccgcaagcga caggccgatc atcgtcgcgc tccagcgaaa 2820 
gcggtcctcg ccgaaaatga cccagagcgc tgccggcacc tgtcctacga gttgcatgat 2880 
aaagaagaca gtcataagtg cggcgacgat agtcatgccc cgcgcccacc ggaaggagct 2940 
gactgggttg aaggctctca agggcatcgg tcgacgctct cccttatgcg actcctgcat 3000 
taggaagcag cccagtagta ggttgaggcc gttgagcacc gccgccgcaa ggaatggtgc 3060 
atgcaaggag atggcgccca acagtccccc ggccacgggg cctgccacca tacccacgcc 3120 
gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc 3180 
gatataggcg ccagcaaccg cacctgtggc gccggtgatg ccggccacga tgcgtccggc 3240 
gtagaggatc cacaggacgg gtgtggtcgc catgatcgcg tagtcgatag tggctccaag 3300 
tagcgaagcg agcaggactg ggcggcggcc aaagcggtcg gacagtgctc cgagaacggg 3360 
tgcgcataga aattgcatca acgcatatag cgctagcagc acgccatagt gactggcgat 3420 
gctgtcggaa tggacgatat cccgcaagag gcccggcagt accggcataa ccaagcctat 3480 
gcctacagca tccagggtga cggtgccgag gatgacgatg agcgcattgt tagatttcat 3540 
acacggtgcc tgactgcgtt agcaatttaa ctgtgataaa ctaccgcatt aaagctcatg 3600 
cggatcagtg agggtttgca actgcgggtc aaggatctgg atttcgatca cggcacgatc 3660 
atcgtgcggg agggcaaggg ctccaaggat cgggccttga tgttacccga gagcttggca 3720 
cccagcctgc gcgagcaggg gaattgatcc ggtggatgac cttttgaatg acctttaata 3780 
gattatatta ctaattaatt ggggacccta gaggtcccct tttttatttt aaaaattttt 3840 
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tcacaaaacg gtttacaagc ataaagcttg gcactggcc, tcgttttaca acgtcgtgac 3900 
tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatocccc tttcgccagc 3960 
tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat 4020 
ggcgaatggc gcctgatgc, gtattttctc cttacgcatc tgtgcggtat ttcacaocgc 4030 
atatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gccccgacac 4!40 
ccgccaacac ccgctgacgc gccctgacgg gottgtctgc tcccggcatc cgcttacaga 4200 
caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa 4260 
cgcgcgagac gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata 4320 
atggtttctt agacgtcagg tggcactttt cggggaaatg tgcgoggaac ccctatttgt 4380 
ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg 4440 
cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt 4500 
cccttttttg cggcattttg ccttoctgtt tttgctcacc cagaaacgct ggtgaaagta 4560 
aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcoaactgga tctcaacagc 4620 
ggtaagatcc ttgagag.tt tcgccccgaa gaacgt.ttc caatgatgag cacttttaaa 4630 
gttctgctat gtggcgcggt attatcccgt attgaogccg ggcaagagca actcggtcgc 4740 
cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt 4300 
acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact 4860 
gcggccaact tacttctgac aacgatcgga ggaccgaagg agotaaccgc ttttttgcac 4920 
aacatggggg atcatgtaac tcgccttgat cgttgggaao cggagctgaa tgaagccata 4 980 
ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta 5040 
ttaactggog aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg 5100 
gataaagttg caggaccact tctgcgctcg gccct.ccgg ctggctggtt -.attgctgat 5160 
aaatctggag ccggtgagcg tgggtccgc ggtatcattg cagcactggg gccagatggt 5220 
aagccctccc jtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga 5230 
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aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa 5340 
gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag 5400 
gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac 5460 
tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc 5520 
gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat 5580 
caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat 5640 
actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct 5700 
acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt 5760 
cr.taccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg 5820 
gggggttogt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 5880 
cagcgcgagc tatgagaaag cgccacgctt occgaaggga gaaaggogga caggtatccg 5940 
g,aagcggca gggtoggaac aggagagcgc acgagggagc ttccaggggg aaacgcotgg 6000 
tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc 6060 
tcgteagggg ggcggagcct atggaaaaac gocagcaacg cggccttttt acggttcctg 6120 
gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 6180 
aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc 6240 
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aqcgagtcag tgagcgagga agcggaaga 
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Description of Artificial Sequence: udp and deoD 
clone/in pUC18 so to create a fusion between the 
two proteins bonded to each other via an aa linker 



gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 60 
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cgacaggttt ccegactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 120 
cactcattag gcaocccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct 240 
cggtaccatc catgtccaag tctgatgttt ttcatctcgg cctcactaaa aacgatttac 300 
aaggggctac gcttgccatc gtccctggcg acccggatcg tgtggaaaag atcgccgcgc 360 
tgatggataa gccggttaag ctggcatctc accgcgaatt cactacctgg cgtgcagagc 420 
tggatggtaa acctgttatc gtctgctcta ccggtatcgg cggcccgtct aoctctattg 480 
ctcttcaaga gctggcacag ctgggcattc gcaccttcct gcgtatcggt acaacgggcg 540 
ctattcagcc gcatattaat gtgggtgatg tcctggttac cacggogtct gtccgtctgg 600 
atggcgcgag cctgcacttc gcaccgctgg aa.tcccggc tgtcgctgat ttcgaatgta 660 
cgactgcgct ggttgaagct gcgaaatcca ttggcgcgac aactcacgtt ggcgtgacag 120 
cttcttctga taccttctac ccaggtcagg aacgttacga tacttactct ggtcgcgtag 180 
ttcgt,ac,t taaaggtt.t atggaagagt ggcaggcgat gggcgtaatg aactatgaaa 840 
tggaatctgc aaccctgctg accatgtgtg caagtcaggg cctgcgtgcc ggtatggtag 900 
cgggtgttat cgttaaccgc acccagcaag agatcccgaa tgctgagacg atgaaacaaa .60 
ccgaaagcca tgcggtgaaa atcgtggtgg aagcggcgcg tcgtctgctg tccatgggcg 1020 
gtggcagccc gggcattctg gccatggcta ccccacacat taatgcagaa atgggcgatt 1080 
tcgctgacgt agttttgatg ccaggcgacc cgctgcgtgc gaagtatatt gctgaaactt 1140 
tccttgaaga tgcccgtgaa gtgaacaacg ttcgcggtat gctgggcttc accggtactt 1200 
acaaaggccg caaaatttcc gtaatgggtc acggtatggg tatcccgtcc tgctccatct 1260 
acaccaaaga actgatcacc gatttcggcg tgaagaaaat tatccgcgtg ggttcctgtg 1320 
gcgcagttct gocgcacgta aaactgcgcg acgtcgttat cggtatgggt gcctgcaccg 1380 
a,tccaaagt taaccgcatc cgttttaaag accatgactt tgccgctatc gctgacttcg 1440 
acatggtgcg taacgcagta gatgcagcta aagcactggg tattgatgct cgcgtgggta 1500 
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acctgttctc cgctgacctg ttctactctc cggacggcga aatgttcgac gtgatggaaa 1560 
aatacggcat tctcggcgtg gaaatggaag cggctggtat ctacggcgtc gctgcagaat 1620 
ttggcgcgaa agccctgacc atctgcaccg tatctgacca catccgcact cacgagcaga 1630 
ccactgccgc tgagcgtcag actaccttca acgacatgat caaaatcgca ctggaatccg 1740 
ttctgctggg cgataaagag taagtcgacc tgcaggcatg caagctttat gcttgtaaac 1800 
cgttttgtga aaaaattttt aaaataaaaa aggggacctc tagggtcccc aattaattag 1860 
taatataatc tattaaaggt cattcaaaag gtcatccacc ggatcagctt agtaaagccc 1920 
tcgctagatt ttaatgcgga tgttgcgatt acttcgccaa ctattgcgat aacaagaaaa 1980 
agccagcctt tcatgatata tctcccaatt tgtgtagggc ttattatgca cgcttaaaaa 2040 
taataaaagc agacttgacc tgatagtttg gctgtgagca attatgtgct tagtgcatct 2100 
aacgcttgag ttaagccgcg ccgcgaagcg gcgtcggctt gaacgaattg ttagacatta 2160 
tttgccgact accttggtga tctcgccttt cacgtagtgg acaaattctt ccaactgatc 2220 
tgcgcgccga gatgcgccgc gtgcggctgc tggagatggc ggacgcgatg gatatgttct 2280 
gccaagggtt ggtttgcgca ttcacagttc tccgcaagaa ttgattggct ccaattcttg 2340 
gagtggtgaa tccgttagcg aggtgccgcc ggcttccatt caggtcgagg tggcccggct 2400 
ccatgcaccg cgacgcaacg cggggaggca gacaaggtat agggcggcgc ctacaatcca 2460 
tgccaacccg ttccatgtgc tcgccgaggc ggcataaatc gccgtgacga tcagcggtcc 2520 
agtgatcgaa gttaggctgg taagagccgc gagcgatcct tgaagctgtc cctgatggtc 2530 
gtcatctacc tgcctggaca gcatggcctg caacgcgggc atcccgatgc cgccggaagc 2 640 
gagaagaatc ataatgggga aggccatcca gcctcgcgtc gcgaacgcca gcaagacgta 2700 
gcccagcgcg tcggccgcca tgccggcgat aatggcctgc ttctcgccga aacgtttggt 2760 
ggcgggacca gtgacgaagg cttgagcgag ggcgtgcaag attccgaata ccgcaagcga 2820 
caggccgatc atcgtcgcgc tccagcgaaa gcggtcctcg ccgaaaatga cccagagcgc 2830 
tgccgg=acc tgtcctacga gttgcatgat aaagaagaca gtcataagtg cggcgacgat 2940 
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agtcatgccc cgcgcccacc ggaaggagct gactgggttg aaggctctca agggcatcgg 3000 
tcgacgctct cccttatgcg actcctgcat taggaagcag cccagtagta ggttgaggcc 3060 
gttgagcacc gccgccgcaa ggaatggtgc atgcaaggag atggcgccca acagtccccc 3120 
ggccacgggg cctgccacca tacccacgcc gaaacaagcg ctcatgagcc cgaagtggcg 3180 
agcccgatct tccccatcgg tgatgtcggc gatataggcg ccagcaaccg cacctgtggc 3240 
gccggtgatg ccggccacga tgcgtccggc gtagaggatc cacaggacgg gtgtggtcgc 3300 
catgatcgcg tagtcgatag tggctccaag tagcgaagcg agcaggactg ggcggcggcc 3360 
aaagcggtcg gacagtgctc cgagaacggg tgcgcataga aattgcatca acgcatatag 3420 
cgctagcagc acgccatagt gactggcgat gctgtcggaa tggacgatat cccgcaagag 3430 
gcccggcagt accggcataa ccaagcctat gcctacagca tccagggtga cggtgccgag 3540 
gatgacgatg agcgcattgt tagatttcat acacggtgcc tgactgcgtt agcaatttaa 3600 
ctgtgataaa ctaccgcatt aaagctcatg cggatcagtg agggtttgca actgcgggtc 3660 
aagaatctgg atttcgatca cggcacgatc atcgtgcggg agggcaaggg ctccaaggat 3720 
cgggccttga tgttacccga gagcttggca cccagcctgc gcgagcaggg gaattgatcc 3780 
ggtggatgac cttttgaatg acctttaata gattatatta ctaattaatt ggggacccta 3840 
gaggtcccct tttttatttt aaaaattttt tcacaaaacg gtttacaagc ataaagcttg 3900 
gcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat 3960 
cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat 4020 
cgcccrtccc aacagttgcg cagcctgaat ggcgaatggc gcctgatgcg gtattttctc 4080 
cttacgcatc tgtgcggtat ttcacaccgc atatggtgca crctcagtac aatctgctct 4140 
gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc gccctgacgg 4200 
gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg gagctgcatg 4260 
tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgagac gaaagggcct cgtgatacgc 4320 
ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt 4330 
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cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat 4440 
ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg 4500 
agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt 4560 
tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga 4620 
gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa 4630 
gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt 4740 
attgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt 4800 
gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc 4860 
agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga 4920 
ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat 4980 
cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct 5040 
gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc 5100 
cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg 5160 
gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc 5220 
ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg 5230 
acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca 5340 
ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta gattgattta 5400 
aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc 5460 
aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa 5520 
ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca 5580 
ccgctaccag cggtggtttg rttgccggat caagagctac caactctttt tccgaaggta 5640 
actggc-.tca gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc 5700 
caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca 57 60 
gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta 5820 
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ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag 5880 
cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt 59.0 
cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc 6000 
acgagggagc ttccagggg, aaacgcctgg tatettt.t. gtcctgtcg, gtttcgccac 6060 
ctctgacttg agcgtcgatt tttgtgatgo tcgtcagggg ggcggagcct atggaaaaac 6120 
gccagoaacg cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc 6130 
tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat 6240 
accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaaga 6299 



<210> ii 
<211> 2297 
<212> DtJA 

<213> Artificial Sequence 

023> Description of Artificial Sequence: cloning vector 

derived from pUC18 

gggcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagaattcg 60 
agctcggtac ccggggatcc tctagagtcg acctgcaggc atgcaagctt atggtgcact 120 
ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc gccaacaccc 180 
gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc 2,0 
gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caocgaaacg cgcgagacga 300 
aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag 360 
acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa 420 
atacattcaa atatgt.tcc gctcatgaga caataaccct gataaatgct tcaataatat 480 
tgaaaaagga agagtatgag t.ttcaac.t ttccgtgtcg cccttattcc cttttttgc, 540 
gcattttgcc ttcctgtttt tgctcaccca gaaa=,ctgg tgaaagtaaa agatgctgaa 600 
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gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt 660 
gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt 720 
ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat 780 
tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg 840 
acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta 900 
cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat 960 
catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag 1020 
cgtgacacca cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa 1080 
ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca 1140 
ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc 1200 
ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt 1260 
atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc 1320 
gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat 1380 
atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt 1440 
tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac 1500 
cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc 1560 
ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca 1620 
actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta 1680 
gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct 1740 
ctgctaatcc rgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg 1800 
gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc 1860 
acacagccca gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta 1920 
tgagaaagcg ccacg=ttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg 1980 
gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt 2040 
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cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg 2100 
cggagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg 2160 
ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc 2220 
gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg 2230 

22 97 

agcgaggaag cggaaga 



<210> 12 
<211> 3031 
<212> DNA 

<213> Artificial Sequence 

<- 2 A* Description of Artificial Sequence: udp and deoD 

cloned into pGM746 without upstream ptac promoter 

gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagaattcg 60 
agctcggtac ccggggatcc tagcaggagg gaattcttcc atggctaccc cacacattaa 120 
tgcagaaatg ggcgatttcg ctgacgtagt tttgatgcca ggcgacccgc tgcgtgcgaa 160 
gtatattgct gaaactttcc ttgaagatgc ccgtgaagtg aacaacgttc gcggtatgct 240 
gggcttcacc ggtacttaca aaggccgcaa aatttccgta atgggtcacg gtatgggtat 300 
cccgtcctgc tccatctaca ccaaagaact gatcaccgat ttcggcgtga agaaaattat 360 
ccgcgtgggt tcctgtggcg cagttctgcc gcacgtaaaa ctgcgcgacg tcgttatcgg 420 
tatgggtgcc tgcaccgatt ccaaagttaa ccgcatccgt tttaaagacc atgactttgc 480 
cgctatcgct gacttcgaca tggtgcgtaa cgcagtagat gcagctaaag cactgggtat 540 
tgatgctcgc gtgggtaacc tgttctccgc tgacctgttc tactctccgg acggcgaaat 600 
gttcgacgtg atggaaaaat acggcattct cggcgtggaa atggaagcgg ctggtatcta 660 
cggcgtcgct gcagaatttg gcgcgaaagc cctgaccatc tgcaccgtat ctgaccacat 720 

,,, :a c:cac gagcagacca ctgccgctga gcgtcagact accttcaacg acatgatcaa 780 
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aatcgcactg gaatccgttc tgctgggcga taaagagtaa gtcgacctgc aggcatgcaa 840 
gcttatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac 900 
acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca 960 
gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga 1020 
aacgcgcgag acgaaagggc ctcgtgatac gcctattttt ataggttaat gtcatgataa 1080 
taatggtttc ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga acccctattt 1140 
gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa 1200 
tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta 1260 
ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag 1320 
taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca 1380 
gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta 1440 
aagttctgct atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc 1500 
gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc 1560 
ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca 1620 
ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc 1680 
acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca 1740 
taccaaacga cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac 1800 
tattaactgg cgaactactt actctagctt cccggcaaca attaatagac tggatggagg I860 
cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg 1920 
ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg 1980 
gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac 2040 
gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc 2100 
aagtttactc atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct 2160 
aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc 2220 
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actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc 2280 
gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg 2340 
atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa 2400 
atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc 2460 
ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt 2520 
gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa 2580 
cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc 2640 
tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc 2700 
cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct 2760 
ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat 2820 
gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc 2880 
tggccttttg ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg 2940 
ataaccgtat taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc 3000 

^0 31 

gcaqcgagtc agtgagcgag gaagcggaag a 



<210> 13 
<211> 3128 
<212> DNA 

<213> Artificial Sequence 

<220> . , 

<223> Description of Artificial Sequence: deoD cloned 

downstream ptac promoter 

<4 00> 13 . ,,, 

gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagaattcg bO 

agctccgaca tcataacggt tctggcaaat attctgaaat gagctgttga caattaatca 120 

tcggctcgta taatgcgtgg aattgtgagc ggataacaat ttcacacagg aggatcctag 180 

caggagggaa ttcttccatg gctaccccac acattaatgc agaaatgggc gatttcgctg 240 
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acgtagtttt gatgccaggc gacccgctgc gtgcgaagta tattgctgaa actttccttg 300 
aagatgcccg tgaagtgaac aacgttcgcg gtatgctggg cttcaccggt acttacaaag 360 
gccgcaaaat ttccgtaatg ggtcacggta tgggtatccc gtcctgctcc atctacacca 420 
aagaactgat caccgatttc ggcgtgaaga aaattatccg cgtgggttcc tgtggcgcag 480 
ttctgccgca cgtaaaactg cgcgacgtcg ttatcggtat gggtgcctgc accgattcca 540 
aagttaaccg catccgtttt aaagaccatg actttgccgc tatcgctgac ttcgacatgg 600 
tgcgtaacgc agtagatgca gctaaagcac tgggtattga tgctcgcgtg ggtaacctgt 660 
tctccgctga cctgttctac tctccggacg gcgaaatgtt cgacgtgatg gaaaaatacg 720 
gcattctcgg cgtggaaatg gaagcggctg gtatctacgg cgtcgctgca gaatttggcg 780 
cgaaagccct gaccatctgc accgtatctg accacatccg cactcacgag cagaccactg 840 
ccgctgagcg tcagactacc ttcaacgaca tgatcaaaat cgcactggaa tccgttctgc 900 
tgggcgataa agagtaagtc gacctgcagg catgcaagct tatggtgcac tctcagtaca 960 
atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgcg 1020 
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 1080 
agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg aaagggcctc 1140 
gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt 1200 
ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca 1260 
aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg 1320 
aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc 1380 
cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg 1440 
ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt 1500 
cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta 1560 
ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat 1620 
gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga 1680 
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gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca 1740 

acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact 1800 

cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc 1360 

acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact 1920 

ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt 1980 

ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt 2040 

gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt 2100 

atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata 2160 

gatgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag 2220 

attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat 2230 

ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa 2340 

aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca 2400 

aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt 2460 

ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg 2520 

tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc 2580 

ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga 2640 

cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc 2700 

agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc 2760 

gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca 2820 

ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg 2880 

tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta 2940 

tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct 3000 

cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag 3060 

tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa 3120 
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3128 



gcggaaga 



<210> 14 
<211> 3934 
<212> DNA 

<213> Artificial Sequence 

<220> , , , n 

<223> Description of Artificial Sequence: udp and deoD 
cloned downstream ptac promoter 

gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagaattcg 60 
agctccgaca tcataacggt tctggcaaat attctgaaat gagctgttga caattaatca 120 
tcggctcgta taatgtgtgg aattgtgagc ggataacaat ttcacacagg aggatcctag 180 
caggagggaa ttcttccatg gctaccccac acattaatgc agaaatgggc gatttcgctg 240 
acgtagtttt gatgccaggc gacccgctgc gtgcgaagta tattgctgaa actttccttg 300 
aagatgcccg tgaagtgaac aacgttcgcg gtatgctggg cttcaccggt acttacaaag 360 
gccgcaaaat ttccgtaatg ggtcacggta tgggtatccc gtcctgctcc atctacacca 420 
aagaactgat caccgatttc ggcgtgaaga aaattatccg cgtgggttcc tgtggcgcag 480 
ttctgccgca cgtaaaactg cgcgacgtcg ttatcggtat gggtgcctgc accgattcca 540 
aagttaaccg catccgtttt aaagaccatg actttgccgc tatcgctgac ttcgacatgg 600 
tgcgtaacgc agtagatgca gctaaagcac tgggtattga tgctcgcgtg ggtaacctgt 660 
tctccgctga cctgttctac tctccggacg gcgaaatgtt cgacgtgatg gaaaaatacg 720 
acattctcgg cgtggaaatg gaagcggctg gtatctacgg cgtcgctgca gaatttggcg 780 
cgaaagccct gaccatctgc accgtatctg accacatccg cactcacgag cagaccactg 840 
ccgctgagcg tcagactacc ttcaacgaca tgatcaaaat cgcactggaa tccgttctgc 900 
tgggcgataa agagtaagtc gacacaggaa acagctatga ccatgattac gaattcgagc 960 

t-ggtaccat ccatgtccaa gtctgatgtt tttcatctcg gcctcactaa aaacgattta 1020 
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caaggggcta 
ctgatggata 
ctggatggta 
gctgttgaag 
gctattcagc 
gatggcgcga 
acgactgcgc 
gcttcttctg 
gt tcgtcact 
atggaatctg 
gcgggtgtta 
accgaaagcc 
taagcttatg 
gacacccgcc 
acagacaagc 
cgaaacgcgc 
taataatggt 
tttgtttatt 



aaatgcttca 
ttattccctt 
aagtaaaaga 
acagcggtaa 
ttaaagttct 
gtcgccgcat 



cgcttgccat 
agccggttaa 
aacctgttat 
agctggcaca 
cgcatattaa 
gcctgcactt 
tggttgaagc 
ataccttcta 
ttaaaggttc 
caaccctgct 
tcgttaaccg 
atgcggtgaa 
gtgcactctc 
aacacccgct 
tgtgaccgtc 
gagacgaaag 
ttcttagacg 
tttctaaata 
ataatartga 
ttttgcggca 
tgctgaagat 
gatccttgag 
gctatgtggc 
acactatt ct 



Norphll . app 
cgtccctggc gacccggatc 
gctggcatct caccgcgaat 
cgtctgctct accggtatcg 
gctgggcatt cgcaccttcc 
tgtgggtgat gtcctggtta 
cgcaccgctg gaattcccgg 
tgcgaaatcc attggcgcga 
cccaggtcag gaacgttacg 
tatggaagag tggcaggcga 
gaccatgtgt gcaagtcagg 
cacccagcaa gagatcccga 
aatcgtggtg gaagcggcgc 
agtacaatct gctctgatgc 
gacgcgccct gacgggcttg 
tccgggagct gcatgtgtca 
ggcctcgtga tacgcctatt 
tcaggtggca cttttcgggg 
cattcaaata tgtatccgct 
aaaaggaaga gtatgagtat 
ttttgccttc ctgtttttgc 
cagttgggtg cacgagtggg 
agttttcgcc ccgaagaacg 
gcggtattat cccgtattga 



cagaatgact tggttgagta 
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gtgtggaaaa 
tcactacctg 
gcggcccgtc 
tgcgtatcgg 
ccacggcgtc 
ctgtcgctga 
caactcacgt 
atacttactc 
tgggcgtaat 
gcctgcgtgc 
atgctgagac 
gtcgtctgct 
cgcatagtta 
tctgctcccg 
gaggttttca 
tttataggtt 
aaatgtgcgc 
catgagacaa 
tcaacatttc 
tcacccagaa 
ttacatcgaa 
ttttccaatg 
cgccgggcaa 
ctcaccagtc 



gatcgccgcg 
gcgtgcagag 
tacctctatt 
tacaacgggc 
tgtccgtctg 
tttcgaatgt 
tggcgtgaca 
tggtcgcgta 
gaactatgaa 
cggtatggta 
gatgaaacaa 
gtaattctct 
agccagcccc 
gcatccgctt 
ccgtcatcac 
aatgtcatga 
ggaaccccta 
taaccctgat 
cgtgtcgccc 
acgctggtga 
ctggatctca 
atgagcactt 
gagcaactcg 
acagaaaagc 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
163 0 
1740 
1800 
1860 
192 0 
1980 
2040 
2100 



2160 
2220 
2280 
2 340 
2400 
2460 



Norphll . app 

atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata 2520 
acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt 2530 
tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag 2640 
ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca 2700 
aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactggatgg 2760 
aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg 2820 
ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag 2880 
atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg 2940 
aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag 3000 
accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga 3060 
tc taggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt 3120 
tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc 3180 
tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc 3240 
cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac 3300 
caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac 3360 
cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt 3420 
cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct 3480 
gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat 3540 
acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt 3600 
atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg 3660 
cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt 3720 
gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt 3780 
tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattcrg 3840 
tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg 3900 
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Morphll . app 

3934 

agcgcagcga gtcagtgagc gaggaagcgg aaga 



<210> 15 
<211> 6046 
<212> DNA 

<213> Artificial Sequence 



< 2 4Z Description of Artificial Sequence: udp and deoD 
cloned downstream ptac promoter 

g'gcocaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagaattcg 60 
agctccgaca tcataacggt tctggcaaat attctgaaat gagctgttga caattaatca 120 
tcggctcgta taatgtgtgg aattgtgagc ggataacaat ttcacacagg aggatcctag 130 
caggagggaa ttcttccatg gctaccccac acattaatgc agaaatgggc gatttcgctg 2.0 
acgtagtttt gatgocaggc gacccgctgc gtgcgaagta tattgctgaa actttcctt, 300 
aagatgcccg tgaagtgaac aacgttcgcg gtatgctggg cttcaccggt acttacaaag 360 
occgcaaaat ttccgtaatg ggtcacggta tgggtatccc gtcctgctcc atctacacca 420 
aagaactgat caccgatttc ggcgtgaaga aaattatccg cgtgggttcc tgtggcgcag 480 
ttc-.gccgca cgtaaaactg cgcgacgtcg ttatcggtat gggtgcctgc accgattcca 540 
aagttaaccg catccgtttt aaagaccatg actttgccgc tatcgctgac ttcgacatgg 600 
cgcgtaacgc agtagatgca gctaaagcac tgggtattga tgctcgcgtg ggtaacctgt 660 
tctccgctga cctgttctac tctccggacg gcgaaatgtt cgacgtgatg gaaaaatacg 720 
gcattctcgg cgtggaaatg gaagcggctg gtatctacgg cgtcgctgca gaatttggcg 780 
=gaaagcc=t gaccatccgc accgtatctg accacatccg cactcacgag cagaccactg 840 
^otgagcg tcagactacc ttcaacgaca tgatcaaaat cgcactggaa tccgttctgc 900 
tgggcgataa agagtaagtc gacacaggaa acagctatga ccatgattac gaattcgagc 360 
tcggtaccat ccatgtccaa gtctgatgtt tttcatctcg gcctcactaa aaacgattta 1020 
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caaggggcta 
ctgatggata 
ctggatggta 
gctgttgaag 
gctattcagc 
gatggcgcga 
acgactgcgc 
gcttcttctg 
gttcgtcact 
atggaatctg 
gcgggtgtta 
accgaaagcc 
taagctttat 
tagggtcccc 
ggatcagctt 
ctattgcgat 
ttattatgca 
attatgtgct 
gaacgaattg 
acaaattctt 
ggacgcgatg 
ttgattggct 
caggtcgagg 
agggcggcgc 



cgcttgccat 
agccggttaa 
aacctgttat 
agctggcaca 
cgcatattaa 
gcctgcactt 
tggttgaagc 
ataccttcta 
ttaaaggttc 
caaccctgct 
tcgttaaccg 
atgcggtgaa 
gcttgtaaac 
aattaattag 
agtaaagccc 
aacaagaaaa 
cgcttaaaaa 
tagtgcatct 
ttagacatta 
ccaactgatc 
gatatgttct 
ccaattcttg 
tggcccggct 
ctacaatcca 



Norphll . app 
cgtccctggc gacccggatc 
gctggcatct caccgcgaat 
cgtctgctct accggtatcg 
gctgggcatt cgcaccttcc 
tgtgggtgat gtcctggtta 
cgcaccgctg gaattcccgg 
tgcgaaatcc attggcgcga 
cccaggtcag gaacgttacg 
tatggaagag tggcaggcga 
gaccatgtgt gcaagtcagg 
cacccagcaa gagatcccga 
aatcgtggtg gaagcggcgc 
cgttttgtga aaaaattttt 
taatataatc tattaaaggt 
tcgctagatt ttaatgcgga 
agccagcctt tcatgatata 
taataaaagc agacttgacc 
aacgcttgag ttaagccgcg 
tttgccgact accttggtga 
tgcgcgccga gatgcgccgc 
gccaagggtt ggtttgcgca 
gagtggtgaa tccgttagcg 
ccatgcaccg cgacgcaacg 



tgccaacccg ttccatgtgc 
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gtgtggaaaa 
tcactacctg 
gcggcccgtc 
tgcgtatcgg 
ccacggcgtc 
ctgtcgctga 
caactcacgt 
atacttactc 
tgggcgtaat 
gcctgcgtgc 
atgctgagac 
gtcgtctgct 
aaaataaaaa 
cattcaaaag 
tgttgcgatt 
tctcccaatt 
tgatagtttg 
ccgcgaagcg 
tctcgccttt 
gtgcggctgc 
ttcacagttc 
aggtgccgcc 
cggggaggca 
tcgccgaggc 



gatcgccgcg 
gcgtgcagag 
tacctctatt 
tacaacgggc 
tgtccgtctg 
tttcgaatgt 
tggcgtgaca 
tggtcgcgta 
gaactatgaa 
cggtatggta 
gatgaaacaa 
gtaattctct 
aggggacctc 
gtcatccacc 
acttcgccaa 
tgtgtagggc 
gctgtgagca 
gcgtcggctt 
cacgtagtgg 
tggagatggc 
tccgcaagaa 
ggcttccatt 
gacaaggtat 
ggcataaatc 



1080 
1140 



1200 
12 60 
1320 
1380 
1440 
1500 
15 60 
162 0 
168 0 
1740 
1300 
1360 
1920 
1980 
2040 
2100 
21 60 
2220 
2280 
2 340 
2400 
2460 



gccgtgacga 
tgaagctgtc 
atcccgatgc 
gcgaacgcca 
ttctcgccga 
attccgaata 
ccgaaaatga 
gtcataagtg 
aaggctctca 
cccagtagta 
atggcgccca 
ctcatgagcc 
ccagcaaccg 
cacaggacgg 
agcaggactg 
aattgcatca 
tggacgatat 
tccagggtga 
tgactgcgtt 
agggtttgca 
agggcaaggg 
gcgagcaggg 
ctaattaatt 
gtttacaagc 



tcagcggtcc 
cctgatggtc 
cgccggaagc 
gcaagacgta 
aacgtttggt 
ccgcaagcga 
cccagagcgc 
cggcgacgat 
agggcatcgg 
ggttgaggcc 
acagtccccc 
cgaagtggcg 
cacctgtggc 
gtgtggtcgc 
ggcggcggcc 
acgcatatag 
cccgcaagag 
cggtgccgag 
agcaatttaa 
actgcgggtc 
ctccaaggat 
gaattgatcc 
ggggacccta 
ataaagctta 



Norphll.app 

agtgatcgaa gttaggctgg 
gtcatctacc tgcctggaca 
gagaagaatc ataatgggga 
gcccagcgcg tcggccgcca 
ggcgggacca gtgacgaagg 
caggccgatc atcgtcgcgc 
tgccggcacc tgtcctacga 
agtcatgccc cgcgcccacc 
tcgacgctct cccttatgcg 
gttgagcacc gccgccgcaa 
ggccacgggg cctgccacca 
agcccgatct tccccatcgg 
gccggtgatg ccggccacga 
catgatcgcg tagtcgatag 
aaagcggtcg gacagtgctc 
cgctagcagc acgccatagt 
gcccggcagt accggcataa 
gatgacgatg agcgcattgt 
ctgtgataaa ctaccgcatt 
aaggatctgg atttcgatca 
cgggccttga tgttacccga 
ggtggatgac cttttgaatg 
gaggtcccct tttttatttt 



tggtgcactc tsagtacaat 
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taagagccgc 
gcatggcctg 
aggccatcca 
tgccggcgat 
cttgagcgag 
tccagcgaaa 
gttgcatgat 
ggaaggagct 
actcctgcat 
ggaatggtgc 
tacccacgcc 
tgatgtcggc 
tgcgtccggc 
tggctccaag 
cgagaacggg 
gactggcgat 
ccaagcctat 
tagatttcat 
aaagctcatg 
cggcacgatc 
gagcttggca 
acc 

tttaata 
aaaaattttt 
ctgctctgat 



gagcgatcct 
caacgcgggc 
gcctcgcgtc 
aatggcctgc 
ggcgtgcaag 
gcggtcctcg 
aaagaagaca 
gactgggttg 
taggaagcag 
atgcaaggag 
gaaacaagcg 
gatataggcg 
gtagaggatc 
tagcgaagcq 
tgcgcataga 
gctgtcggaa 
gcctacagca 
acacggtgcc 
cggatcagtq 
atcgtgcggg 
cccagcctgc 
gattatatta 
tcacaaaacg 
gccgcatagt 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3 600 
3660 
37 20 
37 8 0 
3840 
3 900 



Norphll-app 

taag c= ag cc ccacaccc, =caacaccc g ct g ac g c g cc et g ac ggg c t t gt ct gC tcc 3.60 
cgg catc C9 c tt aca g acaa g c tg t g acc g tctcc ggg a g c tg ca tgtg t ca g a gg «tt ,020 
ca cc gt ca t c acc g aaac g c g c g a g ac g aa a 9gg c=tc 9 t g atac g cc t a tttttat.,, «30 
ttaat g tc.t g ataataat g gtt tc t ta,a = 9 tca ggt99 cac«ttc 9g g9 aaa t9 t 9 c 4140 
gcgg aacccc tatttgttta tttttctaaa taca.tcaaa ta tgt atcc 9 =tca tg a g ac 4200 
aat aa=cct 9 ataaat g ct t caataatatt g aaaa agg a a g a g ta tg a g t attcaacatt 4260 
tc .- s t,tc,c ccttattccc ttttt t9 c gg catt« 9 oct tc=t g ttttt g ctcaccca g 4320 
aaac g ct gg t g aaa gt aaaa g a tg c tg aa g a t ca gttgg9 t9 cac 9 a 9 t g gg ttacatc g 4380 
aac tgg atct ca a =a g c g9 t aa g a t c rt t g a 9 a 9 tt t tc g cccc g a ag aa c grt ttccaa 4440 
tg ataa g =a= ttttaaa gt t c tg cta t9 t 9 g c g c, g tatt atccc.tatt g ac g cc ggg c 4 500 
aaMMt c 9gt c 9 c= 9 c a t acac t a tt ctca g aa t9 a c tt9gt t g a 9 tactcacca, 4560 
tC aca g aaaa g catc tta c g g at gg c atga ca 9 taa 9 a 9 a attat g ca g t g ct g ccataa 4620 
cca tg a gt9 a taacac tg = 9 g cc,a«ta= t t =t 9 a=aac g a« g9 a 9g a cc g aa gg a g c 46,0 
ta.cc.cttt ttt g cacaa= at ggg99 atc at g taactc g cctt g atc g t t999 aacc 9g 4740 
a g ct g aat g a a g ccata=ca aac,ac„c g t g acaccac g at g cct 9 ta 9 caat 9g caa 4800 
caa c 9t t 9cg caaactatta act gg c 9 aac tacttactct a 9 cttccc 9g caacaattaa 4860 
ta 9 act gg at 9g a g9 c 9g at aaa gt t g ca g 9 accacttct 9 c 9 ctc 99 cc cttcc 9g ct 9 4920 
gctgg tttat t g ct g ataaa tct gg a 9 cc g g t 9 a g c g t gg g tctc g c g9 t atcatt 9 ca g 4,80 
cact gg99 cc a 9 at gg taa g c=ctccc g ta tc 9 ta 9 ttat ctaca.ac, g99 a g tca gg 5040 
ca a ctat 99 a t g aac g a,at a g ac ag atc 9 ct g a g ata 9g t g cctcact g attaa g catt 5.00 
99 taact g tc a g accaa 9 tt tactcatata tacttta g at t g attta.aa cttcattttt 5.60 
a atttaaaa g 9 atcta 9gtg aa.atccttt tt g ataatct cat g ac=a.a atcccttaac 5220 

^.nxrc crataqaaaa gatcaaagga tcttcttgag 5-30 
gtgagttttc gttccactga gcgtcagacc ccgtagaa y 

, ,t r M-ac^ tgcaaacaaa aaaaccaccg ctaccagcgg 5^40 
atcctttttt tctgcgcgta atcug^gc. rgca 

Page 54 



Norphll . app 

tggtttgttt gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca 5400 
gagcgcagat accaaatact gtccttctag tgtagccgta gttaggccac cacttcaaga 5460 
actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca 5520 
gtggcgataa gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc 5580 
agcggtcggg ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca 5640 
ccgaactgag atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa 5700 
aggcggacag gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc 5760 
cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc 5820 
gtcgattttt gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg 5880 
cctttttacg gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat 5940 
cccctgattc tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca 6000 
gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc ggaaga 6046 
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